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UNCLASSIFIED 


I 


convoluted  binomial  distributions,  that  is,  distributions  of  variables 
such  as  X,  are  characterized  in  terms  of  a system  of  differential  equa- 
tions satisfied  by  their  probability  mass  functions.  A monotonicity 
property  for  probability  ratios  of  certain  convoluted  binomial  distribu- 
tions is  noted,  and  the  maximum  likelihood  estimate  of  the  binomial 
parameter  is  shown  to  be  easily  obtained  for  these  distributions  using 
the  characterizing  property  established  for  their  mass  functions.  Results 
are  applicable  to  models  for  binomial  signals  in  noise  in  which  the  noise 
distribution  is  known  or  can  be  estimated  from  an  auxiliary  experiment. 

I.  INTRODUCTION 

Consider  a random  variable  X distributed  as  the  sum  of  independent 
components  Y and  Z.  Many  situations  in  which  a variable  such  as  X 
arises  have  the  characteristic  that  one  of  the  components  of  X,  say  Y, 
is  of  particular  importance  to  the  investigator,  while  the  other  component 
Z is  of  lesser  or  no  interest.  Typical  of  such  scenarios  is  the  process 
of  making  observations  with  a Gieger  counter,  where  the  observed  count  is 
the  sum  of  counts  due  to  the  presence  of  radioactive  material  and  counts 
due  to  noise  or  static.  We  refer  to  X as  a signal  plus  noise  variable, 
and  identify  the  component  of  X of  Interest  in  a particular  study  as 
the  signal  variable. 

^ _ 


Estimation  tor  signal  plus  noise  models  has  been  pursued  by  a 
number  of  authors.  Research  on  such  models  may  be  classified  into  two 
categories  --  those  which  postulate  that  the  noise  distribution  is  known 
and  those  that  do  not.  Among  investigations  of  the  first  type  is  the 
study  by  Gaffey  (1959),  in  which  a consistent  estimator  of  the  signal 
distribution  was  obtained  with  the  noise  variable  modeled  as  a normal 
variable  with  both  parameters  known.  Samaniego  (1976)  studied  maximum 
likelihood  estimation  for  Poisson  signals  in  discrete  noise  with  known 
distribution.  He  showed  that  the  maximum  likelihood  estimate  of  the 
Poisson  parameter  could  be  obtained  for  a broad  class  of  models  via  a 
characterization  result  established  for  convoluted  Poisson  distributions. 

The  assumption  that  the  noise  distribution  is  known  appears  quite  re- 
strictive, but  is  realizable  in  experimental  situations  in  which  the  noise 
variable  may  be  observed  alone.  In  such  situations,  the  empirical  dis- 
tribution of  the  noise  variable  may  serve  to  approximate  the  true  noise 
distribution  to  any  desired  degree  of  accuracy.  Estimation  for  models  in 
which  only  the  functional  form  of  the  noise  distribution  is  known  has  been 
studied  by  Sclove  and  Van.  Ryzin  (1969),  who  obtained  method  of  moments 
estimators  and  their  asymptotic  variances  for  a variety  of  models. 

We  treat  in  this  paper  maximum  likelihood  estimation  for  binomial 
signals  in  discrete  noise  with  known  distribution.  For  the  special  case 
of  the  Binomial -Poisson  model,  moment  estimators  for  the  parameters  appear 
in  Sclove  and  Van  Ryzin  (1969),  while  the  maximum  likelihood  estimate  of  the 
Poisson  parameter  when  the  binomial  parameter  is  known  may  be  obtained  from 
work  in  Samaniego  (1976).  The  complementary  problem  of  estimating  the 
binomial  parameter  by  maximum  likelihood  methods  when  the  Poisson  parameter 
is  known  is  treated  as  an  example  in  Section  III  of  this  paper. 
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Uinomial  sij;nals  in  noise  arise  in  a variety  oi  contexts.  i.xamples 
abound  in  the  literature  on  statistical  communication  theory.  Woodward 
(1953)  discusses  a communication  system  in  which  the  received  signal  is 
the  sum  of  independent  Bernoulli  variables.  Examples  of  recent  work,  on 
discrete  signal  detection  include  that  of  Grecnstein  (1974)  on  block 
coding  of  binary  signals  and  work  by  Crochiere  et  al.  (1976)  on  digital 
coding  of  speech.  Another  source  of  data  well  modelled  as  binomial 
signals  in  discrete  noise  is  the  area  of  acceptance  sampling.  The  number 
of  defectives  found  in  a sample  from  a large  population  is  usually  modelled 
as  a binomial  random  variable.  Inspection  policies  which  result  in 
systematically  overcounting  or  undercounting  the  number  of  defectives  in 
the  sample  give  rise  to  signal  plus  noise  distributions  for  either  the 
number  of  defectives  or  the  number  of  nondefectives  in  the  sample. 

Statistical  literature  contains  a variety  of  formulations  of  the 
notion  of  "almost  binomial"  data.  Among  the  most  widely  studied  relatives 
of  the  binomial  distribution  are:  truncated  binomial  distributions,  for 

which  moment  estimators  were  obtained  by  Rider  (1955)  and  Shah  (1966)  and 
for  which  maximum  likelihood  estimates  were  derived  by  Finney  (1949);  mixtures 
of  binomial  distributions  studied  by  Blischke  (1964);  censored  samples  from 
binomial  distributions,  for  which  Blight  (1970)  developed  useful  theory. 

A formulation  similar  to  that  studied  here  is  the  model  for  misciassification 
of  Bernoulli  data  studied  by  Bryson  (1965).  Models  for  binomial  signals 
in  discrete  noise  may  be  viewed  as  misciassification  models  in  which  the 
counting  mechanism  systematically  overcounts  binomial  data. 

In  Section  II,  we  establish  a characterization  result  for  convoluted 
binomial  distributions  in  terms  of  differential  equations  satisfied  by 
their  probability  mass  functions.  We  show  in  Section  III  that  the 
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characterization  result  may  be  fruitfully  applied  to  maximum  likelihood 
estimation  of  the  binomial  parameter  for  a 1 airly  broad  class  ol  sif^nal 
plus  noise  models. 

11.  CONVOLUTED  BINOMIAL  DISTRIBUTIONS 

Katz  (1946)  characterized  a trilogy  of  discrete  distributions  in 
terms  of  the  difference  equation 

(x+i)f(x+l)  = (a+bx)f(x)  (2.1) 

satis  V the  probability  mass  function  f.  He  showed  that  the  difference 

eq  acterized  Poisson  distributions  among  distributions  with  equal 

mean  and  variance,  while  it  characterized  binomial  (negative  binomial) 
distributions  among  distributions  with  mean  smaller  (larger)  than  its 
variance.  The  characterization  result  we  obtain  below  is  the  third  of  a 
similar  family  of  characterizations  --  these  in  terms  of  differential 
equations  satisfied  by  the  probability  mass  functions.  These  latter  results 
have  the  form 

S PeO^)  ■ - Pe'** 

where  represents  the  probability  mass  function  of  a variable  in  a given 
parametric  family.  The  result  established  here  characterizes  convoluted 
binomial  distributions  by  equations  of  the  form  (2.2).  The  complementary 
result  for  convoluted  Poisson  distributions  is  given  in  Samaniego  (197o), 
and  for  convoluted  Pascal  (negative  binomial)  distributions  in  Hannon  and 
Samaniego  (1977). 


\ 
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Oil  liTiiil  iai  equations  of  the  form  (2.2)  liavi’  bi'iii  i nvesl  i);a(  eil 
earlier.  Boswell  and  Patil  (1973)  obtained  characterizations  of  the 
Poisson,  binomial  and  negative  binomial  distributions  in  terms  of  tliesc 
equations.  Our  work  has  extended  their  results  to  convolutions  of  discrete 
distributions,  and  utilized  these  extensions  in  maximum  likelihood  estima- 
tion for  signal  plus  noise  distributions.  Our  characterization  of  con- 
voluted binomial  distributions  is  as  follows. 


Theorem.  Let  {)C^  ] be  a family  of  nonnegative  integer  valued  random 

N , p 

variables  indexed  by  p G (0,1)  and  N 6 {l,2,’**}.  Let  Z be  a discrete  varia- 
ble whose  distribution  does  not  depend  on  p.  Define  ^ - Z,  and  suppose 

that,  for  every  N,  X > Z as  p — 0.  Then,  P(X=k|N,p)  is  differentiable 

N , p 


in  p and 


~ P(X=klN,p)  = N[P(X=k-l|N~l,p)  - P(X=k|N-l,p) 


(2.3) 


for  all  N > 1,  p € (0,1)  and  k > 0 if,  and  only  if,  the  distribution  of 

X^  is  a convolution  of  the  binomial  distribution  with  parameters  N and 
N , p 

p and  the  distribution  of  the  random  variable  Z. 


Proof : We  denote  the  binomial  distribution  with  parameters  N and  p 

by  B(N,p).  It  is  easy  to  verify  that  if  the  family  {x^  ] consists  pre- 

cisely  of  sums  ^ + Z,  where  Y and  Z are  independent  and  Y ^ B(N,p), 
then  the  probability  mass  function  of  X^  satisfies  (2.3)  for  all  appropriate 
N,  p and  k.  We  prove  the  converse  by  induction  on  N.  Suppose  (2.3) 
is  satisfied  for  N > 1,  p € (0,1)  and  k > 0.  Consider  N = l.  Let  k be  a 
fixed  but  arbitrary  nonnegative  integer.  Wo  have  by  (2.3)  that 


1 

j 

! 
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P(X=k|l,p)  = P(X=k-llO,p)  - P(X=klO,p) 


= P(Z=k-l)  - P(Z=k) 


The  general  solution  of  this  differenLial  equation  is 

P(X=k|l,p)  = p(P(Z=k-l)  - P(Z=k))  + 

where  is  a constant  independent  of  p.  The  boundary  conditon 
X 

X^  ■*  Z as  p "•  0 identifies  c as  P(Z=k),  so  that 
i , p k 

P(X=kU,p)  = pP(Z=k-l)  +(l-p)P(Z=k). 

Thus,  X,  has  the  distribution  of  Y,  + Z,  where  Y and  Z are 
1,P  1,P 

independent  and  Y ~ B(l,p).  We  now  assume  that  for  N <'  N*,  X^  ^ 

has  the  distribution  of  the  sum  Y.,  + Z,  where  Y and  Z are 

N,p 

independent  and  Y ~ B(N,p).  For  N=N*,  we  have  for  each  fixed  k 


P(X=k]N*,p)  = N*[P(X=k-l]N*-l,p)  - P(X=k|N*-l,p)]  (2.4) 


By  the  induction  hypothesis,  we  may  replace  the  probabilities  on 
the  right  hand  side  of  (2.4)  by  the  probabilities  of  the  appropriate 
convoluted  binomial  distribution.  Equating  the  indefinite  integrals 
of  both  sides  of  (2.4)  yields  the  general  solution 

min(k,N*)/^^\  ^ N*-i 

P(X=k|N*,p)  = E p (l-pl  P(Z=k-i)  + c 

i=0  ' ^ 


where  c^^  Is  a constant  independent  of  p.  Invoking  the  fact  that 
£ 

X„.  -*  Z as  p -•  0,  we  have  c,  =0.  Thus,  X,,,  has  the  distribution 

N*,p  r . 


of  I /.  wlu  Ti'  Y aiui  /.  arc'  inclrpc-ndcnl.  and  Y ~ IKN-'  ,p),  cccinph  I in;; 


tiu'  proc'f. 


Convoluted  binomial  distributions  characterized  above  may  be  viewed 

as  mixtures  of  shifted  binomial  distributions.  If  X ~ 15  (N,p)  is  under- 

<x 

stood  to  mean  that  X - a ~ f5(N,p),  then  the  convolution  of  I5(N,p)  with 
the  distribution  represented  as  the  mixture  E p^B^(N,p). 

The  method  of  moments  has  often  proven  tractable  in  estimat  ng  the 
parameters  of  mixtures  of  distributions,  and  might  be  purs>  ed  in  dealing 
with  convoluted  binomial  distributions.  We  choose  instead  to  examine 
maximum  likelihood  estimation  in  the  next  section,  where  we  are  able  to 
directly  apply  the  system  of  differential  equations  (2.3)  satisfied  by 
convoluted  binomial  distributions. 

III.  MAXIMUM  LIKELIHOOD  ESTIMATION 

Estimation  of  the  parameters  of  a convolution  by  the  method  of 
maximum  likelihood  is  in  general  a formidable  analytic  problem.  For  con- 
volutions of  discrete  distributions,  the  likelihood  function  for  a random 
sample  is  a product  of  somewhat  unmanageable  sums,  and  a straightforward 
approach  to  maximization  seems  unpromising.  Sclove  and  Van  Ryzin  (1969) 
proposed  method  of  moments  estimators  for  parameters  of  convolutions, 
showing  this  approach  to  be  tractable  in  many  examples.  Samaniego  (1976) 
showed  that  maximum  likelihood  estimation  was  feasible  for  certain  con- 


voluted Poisson  distributions.  We  show  here  that  a similar  analysis  can 
be  applied  to  convoluted  binomial  distributions. 
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Let  X,,...,X  be  a sample  of  size  n from  a convoluted  binomial 
distribution  with  known  noise  distribution.  Denote  the  likelihood 
function  for  this  sample  by 


L(Xj^,X2,...,x^,P)  = n P(X^=x^|N,p) 

i=l 


The  characterization  result  established  in  Section  II  implies  that  the 
likelihood  equation  ^ Xn  L = 0 may  be  written  as 


n P(X  =x.-1|n-1,p)  - P(X.=x  1n-1,p) 

V ^ ^ ^ ^ — n 

P(X.=xJn,p)  = • 


(3.1) 


We  define  below,  for  a parametric  family  of  probability  distributions,  the 
notion  of  parametric  monotone  ratio  in  a given  parameter. 


Definition.  Let  fP  , p € P,  a 6 C}  be  a family  of  discrete  distributions 

'•  p,^  r 1 J 

indexed  by  a real  parameter  p and  a (possibly  degenerate)  parameter  a. 
The  family  is  said  to  have  parametric  monotone  decreasing  ratio  (PMDR)  in 
p if,  for  each  fixed  a and  for  each  a < b for  which  a and  b receive 
positive  mass  under  P^  ^ for  every  p € P,  the  ratio 


P (X=a) 

P (X=b) 

P,a 


is  decreasing  in  p. 

The  relevance  of  PMDR  to  maximum  likelihood  estimation  for  convoluted 
binomial  distributions  is  seen  in  the  following  lemma. 
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Lfinina  1 . Let  Xp...,X^^  be  a random  sample  from  a convoluted  binomial 
distribution  having  PMDR  in  the  parameter  p.  Then  ^ £n  L is  a de- 
creasing function  of  p. 


Proof;  Since  ^ Xn  L - Z ^ £n  P(X.=x.|n,p)  we  need  only  show  that 
op  i-1  1 i' 

^ £n  P is  decreasing  in  p.  We  assume  that  x is  such  that 

op 

P(X=x|p,N)  > 0 for  some  (and  thus  for  all)  p € (0,1).  Note  that 


P(X=x-l|N-l,p)  - P(X=x|N-1, 
P(X=x|N,p) 


P(X=x-1  N-1, 


- P(X=xIn-1, 


pP(X=x-llN-l,p)  + (1-p)P(X=x|N-1,p) 


P(X=x-1  N-1, 


- P(X=x  N-1, 


p(P(X=x-l |N-l,p)  - P(X=x|N-l,p))  + P(X=x|N-1,p) 


(3.2) 


For  a convoluted  binomial  distribution  with  P(X=x|N,p)  > 0,  the  terms 
P(X=x1n-1,p)  and  P(X=x-l jw-l.p)  cannot  both  be  identically  zero  in  p. 
Moreover,  because  of  PMDR,  their  difference  can  be  zero  for  at  most  one 
value  of  p.  If  the  numerator  is  positive  (negative)  for  all  p € (0,1), 
then  (3.2)  is  a decreasing  positive  (negative)  function  of  p.  This  includes 
the  cases  where  either  P(X=x|N-l,p)  = 0 or  P(X=x-l |n-1 ,p)  = 0 for  which 
(3.2)  is  equal  to  1/p  and  -l/(l-p)  respectively.  Assume  that 
P(X=x|n-1,p)  > 0 and  suppose  the  numerator  of  (3.2)  is  zero  at  p^  € (0,1). 

The  ratio  (3.2)  is  of  the  form 


1 1 


Kp) 

g(p)  + r(p) 

For  p < Py,  h(p)/g(p)  = 1/p  and  h(p)/f(p)  is  noridccreasing,  so  that 
(3.2)  is  positive  and  decreasing  for  p < p^.  The  ratio  (3.2)  equals  zero 
at  p = Pq.  For  p > p^,  we  find  that  (3.2)  is  negative  and  decreasing,  so 
that  (3.2)  is  decreasing  for  p € (0,1).  This  shows  that  ^ £n  P is  a 


dp 


decreasing  function  of  p,  completing  the  proof. 


Suppose  a random  sample  is  taken  from  a convoluted  binomial  distribu- 
tion with  PMDR  in  p,  and  suppose  that  the  noise  distribution  is  known. 

Lemma  1 implies  that  the  likelihood  equation  ^ ;in  L = 0 has  at  most  one 

dp 

root.  Thus,  the  maximum  likelihood  estimate  of  p is  zero,  one  or  the 

unique  solution  of  the  likelihood  equation.  The  MLE  may  easily  be  obtained 

numerically  for  such  problems,  since  values  of  £n  L above  and  below  zero 

dp 

may  be  used  to  approximate  the  MLE  to  any  desired  degree  of  accuracy. 

We  consider  below  examples  of  convoluted  binomial  distributions  with 
PMDR.  In  Example  1,  we  will  invoke  the  following  result. 


Lemma  2.  Let  be  a sequence  of  positive  functions  of  a real  variable 

y defined  on  an  interval  of  the  real  line.  Let  {a^},  be  sequences 

of  positive  real  numbers,  with  a^  < for  all  i.  If  f^(y)/fj(y)  is 

decreasing  in  y whenever  i < j,  then  every  function  of  the  form 


..  , , 1=1  i i 

Hn(y)  = , 

i=l  \ 


ki  < k^ 


Xn’ 


for  n > 2,  is  decreasing  in  y. 


Proof : To  start  an  induction  argument,  we  consider  n = 2, 


H / N - 1 1 2 ‘^2 


which  is  of  the  form 


JiM- 


ch(y)  + g(y) 


Since  h(y)/g(y)  is  decreasing  by  hypothesis,  we  have  that  H2(y)  is  de- 
creasing in  y.  Suppose  the  lenana  holds  for  n.  Then 


n+1 

1=1  1 1 

n+1 

1=1  1 1 1 


•;  k 


n+1’ 


n+1 

,""k/k/y) 

1=1  i 1 


,xH-l  ^ n+1 

^k  V ^ \ ^k  <yV  + ^ ^‘’k  ^k 

1 i=l  i i i=2  *^i  1 i i 


which  is  decreasing  in  y if 


which  is  decreasiiit;  in  p if  the  ratio  (3.4)  below  is  decreasing 
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x-1  x-i-1 

r . . p„(Y=i) 


i=0 


(x-i-l)l  p' 


x-1  x-i 
I P„(Y=i) 


(3.4) 


ilo  P 


since  when  P^(Y=x)  ;•  0,  P^(Y=i)/Pp(Y=x)  is  decreasing  in  p for  all 
i X for  which  P^(Y=i)  > 0.  The  function  (3.4)  satisfies  the  hypotheses 
of  Lemma  2,  being  of  the  form  H^^j^(p),  with 


f^(p)  = Pp(Y=i) 


b. 


e 


x-i-1 


(x-i-1)! 


and 


^i  x-i 


for  = i = 0,1,..., n,  where  n = min(N,x-l).  It  follows  that  the  function 
(3.4),  and  consequently  the  ratio  P(X=x-l)/P(X=x) , is  decreasing  in  p. 
Thus,  the  binomial-Poisson  convolution  has  PMDR  in  the  parameter  p. 


Example  2.  Let  Y be  a random  variable  whose  distribution  belongs  to  a family 
of  distributions  with  PMDR  in  the  parameter  p.  Suppose  the  support  of  Y 
is  equal  to  a set  of  consecutive  positive  integers.  If  Z ~ B(l,n)  and  Y 
and  Z are  independent,  we  show  that  the  distribution  of  X = Y + Z has 
PMDR  in  p.  It  is  easy  to  check  that  P(X=a)/p(X=a+l)  is  decreasing,  where 
a is  the  smallest  integer  that  Y takes  on  with  positive  probability. 

For  any  y > a for  which  P(Y=y)  > 0, 


P(X=y-l)  _ nP(Y=v-2)  + (l-Ti)P(Y=y-l ) 
P(X=y)  “ nP(Y=y-l)  + (l-n)P(Y=y) 


P(I.=JLi2) 

p(Y=y--i) 


+ (1-Tl) 


+ (1-n) 


p(Y=y) 

P(Y=y-l) 


which  is  decreasing  in  p.  If  Y has  finite  support  and 
N = lub{Support  of  y],  then 


P(X=N)  _ nP(Y=N-l)  + (l-n)P(Y=N) 
P(X=N+1)  ' rrP(Y=N) 


which  is  also  decreasing  in  p.  Thus  X = Y + Z has  a distribution  with 
PMDR.  Since  the  support  of  X is  again  equal  to  a set  of  consecutive 
positive  integers,  it  follows  that  the  distribution  of  X = Y + Z,  with 
Y,  Z independent  and  Z ~ BCMjTT)  has  PMDR  in  p.  Moreover,  since  the 
binomial  distribution  itself  has  support  on  consecutive  positive  integers, 
we  conclude  that  convoluted  binomial  distributions  B(N,p)  * B(M,ti)  have  PMDR 
in  p. 

In  Example  2,  we  require  the  support  of  a random  variable  Y to  consist 
of  a set  of  consecutive  integers  for  convolution  with  a Bernoulli  variable 
to  be  PMDR  preserving.  The  necessity  of  this  condition  is  apparent  in  the 
following  example. 


Example  3.  Let  Y be  a variable  taking  values  on  even  integers  according 
to  binomial  probabilities  as  follows: 


u. 


P(Y=2i)  = (^)P^l-p)^'"  i=0,l,...,N. 

It  is  obvious  that  the  distribution  of  Y has  PMDR.  Let  Z ~ be 

independent  of  Y,  and  let  X = Y + Z.  For  even  x, 

P(x-l)  ^ ttP(Y=x-2) 

P(x)  (l-Ti)P(Y=x) 

which  is  decreasing  in  p.  If,  however,  x is  odd, 

P(x-l)  ^ (l-n)P(Y=x-l)  ^ 

P(x)  nP(Y=x-l)  TT  • 

Thus,  the  ratio  P(x-1)/P(x)  is  a constant  independent  of  p. 


In  Examples  1 and  2 given  above,  PMDR  in  the  parameter  p guarantees 
that  the  likelihood  equation  has  at  most  one  solution.  As  we  shall  see, 
there  are  convoluted  binomial  distributions  without  PMDR,  but  satisfying 
a weaker  monotonicity  property,  for  which  we  are  able  to  demonstrate  that 
the  likelihood  equation  has  at  most  one  root. 


Definition.  Let{Pp^^,  p€R  a € fl}  be  a family  of  discrete  distributions 

indexed  by  a real  valued  parameter  p and  a (possibly  degenerate)  parameter 

a.  The  family  is  said  to  have  weak  parametric  monotone  decreasing  ratio  in  p 

(WPMDR)  if,  for  each  fixed  a € Q and  for  each  a < b for  which  a and  b 

receive  positive  mass  under  P for  every  p 6 P.  the  ratio 

p.a 


is  either  decreasing  in  p or  is  equal  to  a constant  independent  oi  p. 


Lemma  3 . Let  be  a random  sample  from  a convoluted  binomial 

distribution  ^ with  WPMDR  in  the  parameter  p.  If  at  least  one 

observation  x.  is  such  that 
1 


P(X=x.-11n-1,p)/P(X=x. 1n-i,p) 


(3.5) 


is  not  identically  1 in  p,  then  — Xn  L is  a decreasing  function  of  p. 


Proof:  For  every  observation  Xj^  for  which  the  ratio  (3.5)  is  identically  1, 

we  have  — f.n  P(X=x^|n,p)  = 0 by  (2.3).  For  any  observation  x^  for  which 
the  ratio  (3.5)  is  decreasing,  ^ Xn  P(X=x^|n,p)  is  decreasing  by  the 
argument  used  in  proving  Lemma  1.  It  remains  to  show  that  this  is  also 
true  for  observations  x^  for  which  the  ratio  (3.5)  is  identically  equal 
to  c ^ 1.  We  have 


^ In  P(X=xJn,p)  = 


P(X=x^-1|n-1,p)  - P(X=x^|n-1,p) 
P(P(X=x.-1|n-1,p)  - P(X=x^1n-1,p))  + P(X=x^|n-1,p)  ■ 


This  function  has  the  form 


g(p)  + f(p) 

wlicre  h(p)/g(p)  2 1/p  and  h(p)/f(p)  = c - 1.  Thus  ^ in  P(X=x^|n,p)  is 

decreasing  in  p.  Since  ^ £n  L is  the  sum  of  functions  each  of  which  is 

op 

decreasing  in  p or  identically  zero,  and  at  least  one  of  these  functions 
is  decreasing,  we  have  that  ^ £n  L is  decreasing  in  p. 


IK 


Wo  now  considor  a pair  of  examples  of  convolutid  binomial  distributions 
with  weak  PMDR. 

Example  4.  Let  Y be  a nonnegative  integer  valued  random  variable  whose 
distribution  belongs  to  a one-parameter  family  of  distributions  with  WPMDR 
in  the  parameter  p.  If  Z is  Independently  distributed  according  to  a 
geometric  distribution  G(0)  with  probability  mass  function 

f(Z=z)  = (i-e)e^  z=0,l,2,...  , 

we  show  that  the  distribution  of  X = Y + Z has  WPMDR  in  p.  Suppose  x - 1 
and  X receive  positive  mass  from  the  distribution  of  X.  If  P(Y=x)  = 0, 

X- 1 

E (l-e)e^P  (Y=x-i-l) 

P(X=x-l)  ^ i^O ^ 

P(X=x)  x 

E (1-0)0^P  (Y=x-i) 

i=0  P 

is  equal  to  1/0.  If  Pp(Y=x)  > 0,  this  ratio  may  be  written  in  the  form 

h.Cp) 

g(p)  + f(p)  ’ 

where  h(p)/g(p)  = 1/0  and  h(p)/f(p)  is  either  constant  or  a decreasing 
function  of  p.  Thus,  for  all  appropriate  x,  the  ratio 

P(X=x-l)/P(X=x)  is  either  constant  or  decreasing  in  p,  establishing  WPMDR 
for  the  distribution  of  X.  Since  the  binomial  distribution  B(N,p)  has 
WPMDR,  we  have  that  the  convolution  B(N,p)  * G(0)  has  WPMDR.  It  is  useful 
to  note  that  for  X ~ B(N,p)  * G(0),  the  ratio  P(x-1)/P(x)  is  decreasing 
in  p for  0 < X < N,  and  is  equal  to  1/0  for  x '>  N.  Since  this  ratio  is 
never  identically  1,  the  function  ~ £n  L is  decreasing  in  p for  samples 
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from  B(N,p)  * G(0). 

Consider  the  convolution  B(N,p)  * G(0)  * G(0),  or  equivalently 


the  convolution  B(N,p)  * riB(2,0)  where  UB(r,0)  represents  the  negative 
binomial  distribution  with  parameters  r and  0.  If  Y ~ B(N,p)  * G(0) 
and  Z ~ G(0),  independent  of  Y,  and  if  X = Y + Z,  then  for  any  x > 0 


X- 1 . 

Z 0^P(Y=x-i-l) 


P(x-l)  _ i=0 

PCx"*  ~ x-1  . 


0 Z 0 P(Y=x-i-l)  + P (Y=x) 


Because  Pp(Y=x)  is  positive  for  all  p,  this  ratio  is  decreasing  in  p. 
Thus  the  family  B(N,p)  * flB(2,0)  has  PMDR  in  p.  The  same  argument  leads 
one  to  conclude  that  B(N,p)  * hB(r,0)  has  PMDR  in  p for  any  r > 2. 


Example  5.  Let  Y ~B(N,p),  and  let  Z be  a random  variable  independent 
of  Y with  a uniform  distribution  on  the  integers  0,1,..., k,  that  is,  with 
probability  mass  function 


P(Z-z)  - 


z — 0 ,l,...,k. 


Suppose  k > N.  Then  the  ratio 

’•  x-1 

Z P (Y=i) 

i=0  P 

X 

Z P (Y=i) 
i=0  P 


if  X < N 


P-Cx.-lj, 

P(x) 


if  N < X < k 


Z P (Y=i) 
i=x-k-l  P 
N 

Z P (Y=i) 
i=x-k  ^ 


ifk<x<N  + k 


?() 


It  is  lliorcfori'  clear  that  the  distribution  of  X lias  WPMDR. 

Suppose  a random  sample  of  size  n is  taken  from  the  convolution 

B(N,p)  *UI0,l,...,kJ,  with  k>N.  The  distribution  B(N-l,p)  * U(0,l,...,kl 

has  WPMDR  in  p.  If  at  least  one  observation  fails  to  satisfy  the  inequality 

N < X.  k,  we  have  by  Lemma  3 that  ^ jCn  L is  decreasing  in  p.  If  all 
— 1 — dp 

Xj^  < k with  at  least  one  x^  < N,  the  MLE  is  p = 0.  If  all  x^^  > N with  at 
least  one  x.  > k,  the  MLE  is  p = 1.  In  all  other  cases  in  which  observations 
x^  smaller  than  N or  larger  than  k exist,  the  MLE  is  the  unique  solution 
of  the  likelihood  equation.  When  all  x^  satisfy  N < x^  < k,  the  likelihood 
is  equal  to  the  constant  (1/k+l)*^,  independent  of  p,  and  any  p € [0,1]  is 
an  MLE. 

For  completeness,  we  note  that  the  convolution  lB(N,p)  *U[0,...,k]  for 

k < N has  PMDR  in  p.  Thus,  if  a sample  is  taken  from  B(N,p)  * U[0,...,k] 

with  k N,  we  have  by  Lemma  1 that  £n  L is  decreasing  in  p and  the 

dp 

MLE  is  zero,  one  or  the  unique  solution  of  the  likelihood  equation. 

The  examples  given  in  this  section  serve  to  demonstrate  that  maximum 
likelihood  estimation  of  the  binomial  parameter  is  easily  accomplished  for 
a fairly  broad  class  of  convoluted  binomial  distributions.  In  addition 
to  the  distributions  examined  in  these  examples,  multiple  convolutions  of 
B(N,p)  with  two  or  more  other  families,  for  example,  the  three-fold 
convolution  B(N,p)  * P(0)  * hB(r,n),  can  be  shown  to  have  PMDR  in  p.  In 
all  such  cases,  with  all  parameters  save  the  binomial  parameter  p known, 
the  maximum  likelihood  estimate  of  p is  easily  found.  It  is  possible  to 


construct  convoluted  binomial  distributions  for  which  the  equation 

£n  L = 0 has  any  number  of  solutions.  Thus,  maximum  likelihood 
dp 
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estimation  can  be  unfeasible  for  such  models  even  by  numerical  methods. 
When  parametric  monotonicity  holds,  the  estimation  problem  is  entirely 
straightforward . 
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